Chat Mining for Gender Prediction
نویسندگان
چکیده
The aim of this paper is to investigate the feasibility of predicting the gender of a text document’s author using linguistic evidence. For this purpose, termand style-based classification techniques are evaluated over a large collection of chat messages. Prediction accuracies up to 84.2% are achieved, illustrating the applicability of these techniques to gender prediction. Moreover, the reverse problem is exploited, and the effect of gender on the writing style is discussed.
منابع مشابه
Gender Detection using Machine Learning Techniques and Delaunay Triangulation
Data mining today is being used widely in diverse areas. For example: fraudulent systems, recommender systems, disease prediction, and numerous other applications. One such application is exploited in this article. This paper presents an approach to detect gender of a person through frontal facial image, using techniques of data mining and Delaunay triangulation. Gender prediction can prove to ...
متن کاملStudy of Scalable Erudition by Using Communal Activities
Communal behavior refers to how individuals perform when they are viewing in social network Location. Load of data generated by social media like Facebook, Twitter, Flickr and YouTube present opportunity and challenges to studying communal performance in a huge range. The range of networks entails scalable learning of models for combined behavior forecast. To deal with the scalability question,...
متن کاملGendered Conversation in a Social Game-Streaming Platform
Online social media and games are increasingly replacing offline social activities. Social media is now an indispensable mode of communication; online gaming is not only a genuine social activity but also a popular spectator sport. With support for anonymity and larger audiences, online interaction shrinks social and geographical barriers. Despite such benefits, social disparities such as gende...
متن کاملClassifying Second Life Player Gender Using Chat Data
The goal of this study was to predict the genders of players of the online game Second Life using linguistic patterns from their chat data. This was accomplished using a rich set of stylistic features combined with various machine learning models. This project builds upon a previous study done at Stanford’s Virtual Human Interaction Lab in which very few linguistic features were used. Results s...
متن کاملEducational Data Mining
Computer-based learning systems can now keep detailed logs of user-system interactions, including key clicks, eye-tracking, and video data, opening up new opportunities to study how students learn with technology. Educational Data Mining (EDM; Romero, Ventura, Pechenizkiy, & Baker, 2010) is concerned with developing, researching, and applying computerized methods to detect patterns in large col...
متن کامل